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ABSTRACT 



Oomputerized Adaptive Testing: A Comparison of tltc Nominal 
Response Model and the three Parameter Logistic Model 



R. J. DeAyala and William R. KbcH, The University of Texas at Austin 



_ _ _ A nominal resfwDrisc mpdcKbased cb adaptive testing prbccdufic (nbmirial C ATI w is im- 

piemented using simulated data. Ability estimates from the nominal CAl' were compared to those from a 
CAT based upon the threc-pararhetcr lo^stic model (3PL CAT). Furthermore, estimates from both CAT 
prbcedures were compared with the knbvvTi true abilities used to generate the sirhulated data. Results 
showed that the nominal CAT's ability estimates were highly correlated with those of the 3PL CAT as well 
as with the true abilities, iurttermore^^the nominal CAT had a significantly hi^er association with nega- 
tive true 0s than did the 3PL CAT arid it also had significaritly lower st^^^ errors of estimate than did 
the 3PL_CAT. Hqweve^^ the nominal model-based CAT had difficulty estimating positive 08 and had a 
poor convergence rate. In contrast^ the 3Pb CAT had a hi^ convergence rate and its performance was 
hot affected by whether the true abilities were positive or negative. Potential reasons for the nominal CAT's 
high nonconvergence rate as well as implications for computerized adaptive testing were discussed. 
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Compuiertzed A d^^ Tisiirtg: A Comparison of the Nominal 
Response Model arid the Three Parameter Logistic Model 

OBJECTIVES 

The objectives of this fesearch were (a) to develop and implement a computerized adaptive testing 
(CAT) procedure based on Bock's (1972) hbmirial response model arid (b) to compare the nominal-based 
eAT's performance in ability estimation with that of a more traditional three-parameter logistic (3PL) 
model-based CAT. 

Gomputenzed adaptive testing methods have primanly been based on two item response models : the 
three-parameter and dnc-pafametcr logistic models. Because both these models are based on 
dichotomously scored item responses, they ignore potentially useful information about the examinee con- 
tained m incorrect responses. The existence of iriformatibri Iri Iricorrect responses has been demonstrated 
in several studies, e.g., Levine and Drasgow (1983), Thisscn (1976). 

Given the existen.:^e of information in incorrect responses as well as itcrri respbrise models which can 
access this information, it is reasonable to try to utilize this information for CAT ability estimation. Spe- 
cifically, the ability estimate for an examinee who has enough partial knowledge to select an alternative 
which is almost correct should be higher than the ability estimate for the examinee who selects an incorrect 
alternative which is attractive brily to the examinee of very low ability. By utilizing the information in in- 
correct responses one can administer items which are more appropriate for the examinee who responded 
incorrectly. Tlie assessment bf this partial knowledge requires a polychbtombus model. 

The potential advantages of a nominal-based CAT are the capahiiity of using items in a CAT system 
which db riot rieed to be scored dichotombusly and the possibility of greater efficiency in ability estimation 
due to using all the information provided by the examinee's response. 
METHOD 

Programs : Two CAT programs were written, one program was based on the 3FL model (called 3PL CAT), 
whereas the other was based on the nominal response model (called the nominal CAT), IBoth programs 
used maximum likelihood estimation of ability arid iteril Selection was ori the basis of iterri inforriiatibn. 
The adaptive testing simulation was terminated when cither of two criteria were met : a maximum of thirty 
items was reached or when there were rid rribre iterris iri the itcrri pool which had at least a predetermined 
minimum amount of information. The initial ability estimate for an examinee was the thcta corresponding 
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to the mode of the total test informatibri distribUtibh. Each CAT prbgiam was run on a isimulated data set 
where the ©s which produced the response stnngs were known. 

Data : A data set of 1093 exmnineiss was created frbiti five adttiihistratibhs bf The Cblicgc Board's' 
Achievement Test in Mathematics, tevel I, at the University of Texas at Austin. This data set (called the 
original-' 1093) contained bnly individuals whb amswercd at least 80% of the questions arid the last question. 

To work within the constraints of the calibration pro-am, MUbTlfcOG 5 (Thisscn, 1986), one item 
was omitted from the data set and the 5-chdicc items of the Math Level I test were collapsed into 4-choice 
items. Tte item omitted had the lowest joint traditional difficulty and discrimination values of all the items 
on the test. MULTILOG 5 was used tb calibrate the data for bbth the 3PL arid ndrriirial response models. 
It should be noted that the above constraints in MtJbTlLdG arc currently being removed. 

Two simulated data sets were created based bri the item parameters fi*bm the calibration of the 
original- 1093 data set. The item parameter estimates for both the 3PL and nominal response model from 
the test were dbubled tb produce a 98-itcm pbbl for each model. By sampling from a normal distribution, 
1000 examinees (their z-values were considered to be true 0s) were generated. These true 0s plus the 98 
iteiti parameter estiriiates for the 3PL and for the nominal response riiodcl were used to generate binary and 
non-binary response strings with a random error component for each simulated examinee, respectivejy. 
The former of these generated data sets will be referred to as Sim-3PL and the latter as Sim-Nom. Each 
simulation data set was subsequently calibrated to obtain iterri parameter estimates to be used in the simu- 
lated data set CAT runs. 
RES UL TS A ND CONCL USIONS 

Table 1 shows the mean and standard deviation (S.D.) of the number correct (NC) score and coclfi- 
cient alpha (alpha) for the various data sets and Table 2 presents the principal axes analysis of these data. 
As can be seen from these tables, all the data sets may be cbnsidcrcd tb be highly reliable. Further, although 
none of the data sets had a first factor which accounted for a large perccntigc of the total variance, each 
data set's first factor did account for a large proportion bf the cbmmon variance. It was concluded that the 



1 TTie authors wash to thank The Coliege Board for granting permission to use the Mathematiccs Level I Achieve- 
ment test data. 
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data sets did not seriously violate the anidimcnsionality assumption and that the simulated data had dcisir- 
able properties. 

Insert Table I about here 



Insert Table 2 about here 



Table 3 presents statistics on the item pool information for the 0 range -4.6 to 4.6 for the simulation 
data sets. From the literature on the nominal response model (e.g., Bock, 1972; Thisscn, 1976), it was ex- 
pected that the nbmihal model would provide more information than the 3PL model for 0s less than 0.0. 
The entries for the nominal response model in Table 3 as well as Figure 1 arc consistent with this belief. 
Although the simulation data sets did riot coiitairi extremely irifonriative iterris, the nominal response rriodcl, 
provided, on the average, twice as much information per item for the sirnulated data as did the 3Fb model 
(0.20 and 0.10, respectively). In fact, the total information computed over the -4.0 to 4.0 0 range for the 
ribriiinal response model was tv^ice that of the 3PL model (Total Information = 1637.2 and 806.99, re- 
spectively). However, at approximately 0 = 1.5 and above the 3PL model provided more information than 
the nominal model. 



Insert Table 3 about here 



Irisert Table 4 about here 



Tables 4 and 5 present the arialysis of the sirilulated data^s estiriiated 0s for cbrivergerit cases for the 
3Pfc CAT and nominal CAT, respectively. As these tables show, estimated 0s from both CATs arc highly 
related with and accounted for 88% of the variability of the true 0s; Pearson Product CbefTscierits (r) were 
r = 0.94 for the 3PL CAT and r = 0.94 for the nominal GAT. These tables also show that nonconver- 
gence of the 0 eistimates was a problem with the nominal CAT, but not with the 3PL CAT. Inspection 
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of the estimated 0 range for the two CATs for corresponding convergent cases (1 able 6) shows that they 
shared approximately the same lower bound, but their upper bound difTercd considerably, with the nominal 
eAT's upper bound being less than that of the 3PL, 2.(53 vs; 3;80, respectiveiy. 

Insert Table 5 about here 



Insert Table 6 about here 



Comparing the 3PL CAT and the nominal CAT for those 0 estimates which were convergent in both 
CATs (jointly convergent), one sees that the estimates from each CAT were highly related with another, r 
= 0.85, although they were significantly difTererit from brie another (t = -6.6, df= 999, p = 0.0001; see Table 
6). Furthermore, for these jointly convergent cases Tables 7 and 8 show that each CAT demonstrated a 
great deal of association with the true 0s, r = 0.91 for the 3PL CAT and 0.93 for the nominal CAT. 
Furthr^r, the 3PL CAT 0 estimates were not significantly different from the true 0s, although the nominal 
CAT 0 estimates were significantly difTerent frorh the true 0s. 



insert Table 7 about here 



Insert Table 8 about here 



Analysis of the CATs' 0 estimates for true 0s above and below 0 are preserited in Tables 9-18. 
As can be seen frorri Table 9 for the 3PL CAT using its convergent cases orriy, the average 0 estimate was 
significantly different from the true 0s' average, although the two were sigriificaritly related (r = 0.89). This 
same pattern occurred for the nominal CAT for its convergent cases (Table 10) and also when the jointly 
convergent cases were used (Tables 1 1 and 12). The 3PL and nominal CATs' 0 estimates for the jointly 
convergent cases were only moderatedly related (r = 0.62) and were significantly difTerent from onr another 
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(Table 13); Further, the association between the 3PL CAT's 0 estimates and their true values wa signif- 
icantly greater than that of the nominal CAT's (t 2.33, p = 0.6 df=263). 



Insert Table 9 about here 



Insert Table 10 about here 



Similar results were obtained for the true 0s Below zero for the 3PL md nominal CATs using their 
respective convergent cases (Tables 14 and 15). IlbwcVer, when drily the jointly cbriVcrgcrit cases were used, 
neither the 3PL CAT hdr the nominal CATs' average 0 estimate were si^ificantly diflcreht from the av- 
erage true 0 (Table 16 and Table 17); both CATs' © estmatcs were significantly related to the true 0s (3PL 
CAT : r = 0.84; nominal CAT : r = 0.95). Further, from Table 18 it can be seen that the 0 estiriiates 
from the two CATs were not significantly difierent from each other and were highly related to one another 
(t = 0.34, p = 0.74, df = 265; r = 0.78). A significance test for the correlations of each CAT's 0 estimates 
with the true 0s showed that the nominal CAT was significantly more strongly related with the true 0s than 
was the 3PL CAT (t = -13.68, p = 0.0005, df = 442). 



Insert Table 1 1 about licic 



Insert Table 12 about here 



In short, on the basis of the 0 csts mates/true 0s' correlations as wcU as from the similarity of the 0 
estimates' means and the true 0s' nican, both the 3PL CAT arid the nominal CAT performed better for 
true ©s below 0 than for those above 0. 



Insert Table 13 about here 
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Table 19 shows descriptive statistics bh the standard error of the estimate of © (SEE) for Sim-3PL 
and Sim-Nom. In all cases except one, the average SEE and the mode SEE for the ndminal CAT was 
smaHer than that of the 3Pb CAT. 



Insert Table 14 about here 



Figures 2 and 3 show plots of the SEE against the true 0s for the jointly convergent cases of the 3PL 
GAT and the nominal 6AT, respectively. As can be seen from both these figures, the smallest SEEs are 
to be found around the peak of thie total test information for each CAT's item pool. As one proceeds away 
from each pool's respective points of maximum information, the SEEs increase. In general, it appears that 
even with extreme cases, the nominal CAT has lower SEEs than docs the 3Pl. CAT. A t-tcst showed that 
the nominal CAT's SEEs were significantly lower than those of the 3PL CAT (t = 40.5, p = 0.0001). 



Insert Table 15 abotit here 



Figures 4 and 5 show plots of the diffcrchces between the true 0s arid each CAT's cstirhated 0s for 
the jointly convergent cases. From Figure 4 it can be seen that the plot of these differences for the 3PL 
CAT is relatively symmetHcaHy distributed about a difference of 0, regardless of the true 0 value. In con- 
trast. Figure 5 shows a parabolic pattern for the differences computed for the ndminal CAT. As can be seen 
from this figure, the nominal CAT tends to underestimate extreme true 0s and overestimate true 0s (i.e., 
in absolute tenris) in the range -1.25 to 1.25 (apprbxirriately). The bvercstirriatibri bf 0 is believed tb be a 
result of administering items which although they met the minimum information criterion contained little 
informatibri and resulted in a dcgradatibh bf the 0 cslirriatc. Fbr instance, suppose five iterris were ad- 
ministered which had information valucv of 1.7, I a , 1.8, 1.3, and 1.1 for a 0 estimate = 0.5. However, with 
a minimum information cutoff set at 0.20, another, say, ten items with information values in the 0.20 to 
0.30 would be administered. Because these additibhal iterris cbntairi very little informatibn abbut the 0 iri 
question they would, In all probability,introduce degradation in the ability cstirnate. Therefore, it appears 
that the minimum information cutoff for item selection may have been set to low. 
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Insert Table 16 about Here 



Analysis of the true 0s for this ribhcbrivcrgerit ribmirial CAT cases (Figure 6) showed that the majority 
(93.8% of 276 cases) of these cases fell above 0.25 and 56.5% occurred above 0.75. This cdrrespdnds with 
the abbve rrieritibried analyses bf 0 estimates arid their true values when the true 0s where above and below 
d. In short, the nominal CAT experience a great deal of trouble estimating 0s in the positive true 0 range 
arid virtually rib trbuble estiriiatirig negative true 0s. ITic average and mode number of items administered 
for these nonconvergent cases was 3. 



Insert Table 17 about here 



Given the rribde riurriber of iterris adrriinistcrcd for the nominal CAT, the nonconvergent cases can 
be attributed to premature terminatioii of the CAT program. This premature tentiiriatibri is believed tb 
be a result of two factors: The first factor is a lack of appropriate items to be adininistcrcd and is a reflection 
bf the Ibw inforriatibri content of each iterii pbol. A second factor potentially responsible for premSure 
termination may be the algorithm used for providing a 0 estimate when MLE was riot possible. TTic 
riiethbd used was tb calculate for all 0s in a given 0 range the probability of responding in the categoiy the 
examinee chose. TTie new 0 estimate for the cxarriiriee was the 0 correspbriding to the largest prbbability. 
This method was used until it was possible to perform MLH, i.e., the examinee responded in a different 
category. 



Insert Table 18 about here 



Although in the design of the nominal CAT alternative techniques for dcajlng with this spcciaj case 
were considered, it was felt ihzt the abbve riientibhed technique used the pbtcntial of the nominal model 
to a ftiHer extent than the other methods considered. Given the ndncdnvcrgerice problem this decision rriay 
not have been the best. 
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Insert Table 1 9 about here 



To summarize, both CATs' 0 estimates were found to be highly related to the true 0s and to one 
another. It was found that the 3PL CAT had no systematic tendency to either under- or overestimate the 
true ©s. In contrast, the nominal CAT tended to underestimate extfcmc true ©s arid overestimate tfiie 
0s in the range 0.5 to 1.25. In general, both CATs performed better for true 0s below 0 than for those 
above 0. For negative true 0s and using either the CAT's corresponding convergent cases or the jointly 
convergent cases, the nominaj CAT had a stronger association with the tnie 0s than did the 3PL CAT. 
Further, for tiiie values less than 0 both CATs' mean 0 estimates were not significantly different from either 
each other or from the true values' mean. Analysis of the standard errors of the 0 estimates for each CAT 
showed that on the average this hbitiinaJ CAT had lower SEEs than the 3PL CAT, 

Nonconvergence of 0 estimates was a far greater problem for the nominal CAT than for the 3PL 
CAT. Analysis of the nonconvcrgent cases for the nominal CAT reveajcd that the ridmiriaj CAT had dif- 
ficulty with positive tnie 0s and virtually no problem estimating negative true 0s. The nbricbrivcrgerit cases 
for both CATs were attributed to premature termination of the CAT as a result of the low infomation 
content of the items in the pool as well as the ajgorithm used for providing a 0 estimate when rib MLE 
was possible. Given the total test iriformatibri for each simulated data's item pool, it was expected that 
nonconvergence would be a sa-e?^ter problem for the 3PL CAT than for the nominal CAT. Future research 
may identify alternative explariatibri(s) or sblutibri(s) fbr the ribncbrivergcrit prbblcm with the nbriiihal 
CAT. 

EDUCA TIONAL IMPORTANCE 

A nominal model-based CAT would allow the use of both polychotomous and dichotomously scored 
items. These polychotomous items represent a new domain of itcrris which may be used iri adaptive testirig 
erivirc*?.aerits. Further, new arid iriribvativc item types may be developed and applied throu^ a nominal- 
based CAT. These new item types may provide a rilcaris of rilcrgirig cbrriputer assisted iristructibri with 
computerized adaptive testing. 

Although the nominal model docs not require scoring the cxamiriec's respdrisc, there still riiay exist 
the tendency to guess. This teridericy to guess will riiost likely be exhibited when inappropriately difficult 
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items are administered to the examinee. Under the nominal CAT procedure, examinees who select different 
incorrect alternatives will receive different ability isstiriiates arid therefore wili riidst iikeiy be adriiiriisiercd 
differerit next items. This is riot possible with a CAT based bri a dichotbmdusly scored model. By pro- 
viding more accurate ability estimates for persons who respond incorrectly, items may be selected which are 
ihdre approprbpriate for the exarriiricc's currcrit ability estimate. Tliis will probably result in a rcductibn 
of the likelihood of guessing as well as less anxiety for the examinee. 
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Table J : Descriptive Statistics of data sets - _ _ _ _ 

Data Set N litems # Dpt. Mean No. Correct S.D. Alpha 

Origmal 1093. 1093 A9 4 27.1 7.4 0.85 

Sim-3Pb 1999 98 4 56.6 14.7 9.92 

Sim-Nom 1000 98 4 59.8 15.4 0.92 



Table 2 : Factor Analyses 



Data Set 
Onjirial 1093 
Sim-3Pb 
Sim-Nbrh 



Factor 1 Var, Acctd 



N 


# Items 


#Dpt. 


Facior 1 


Factor 2 


Total Cbjmmbh 


1093 


49 


4 


7.23 


1.31 


14.8% 75.8% 


ieee 


98 


4 


11.76 


2;06 


12.0% 85; 1% 


1000 


98 


4 


22.42 


6.17 


22.9% 70.5% 



Table 3 ; Test Infofmatipn for item pools - Simulated data sets, 98 items for the 0 range -4.0 to 4.0. 



Data Set 0 Peaked S.D. Min Max Avg Info/Item Total Info 

Sjm-3PL 1.8 146 1.52 35.04 D.ID M6.99 

Sirri-Nbm 0.4 13.63 2.30 41.30 0.20 1637.2 

Tabic 4 : O Analysis from CAT run on simulated data. 3PL convergent cases only, N = 976 

Data Set 0 Mean S.D r 1 p Mia Max 

Sim-3PL 0.12 1.20 0.94 -4.72 0.0001 -3.94 3.80 

True 0s 0.06 0.99 -2.86 3.23 

Table 5 : 0 Analysis from CAT run on sirriulated data. Nominal convergf^nt cases only,_N= 724 _ _ 

Data Set 0 Mean S.D r t p Min Max 

Siffl-^Nom -0.13 1.31 0.94 -7.47 O.OOOl -3.98 2.63 

True©s 9.29 9:87 -2.86 2.70 



Table 6 : 0 Analysis from CAT run oh simulated data, cbmparisbh of both models jointly cbhvergcht cases 
only, N=^711 ^ - 

Data Set 0 Meir S.D r t p Miit Max 

Sim-3PL -0.25 1.03 0.85 -6.66 0.0001 -3.94 3.8D 

Sim-Nom -0.08 1.27 -3.98 2.63 



Table 7 : 0 Analysis from CAT run on simulated data, iVL & Nominal jointly convergent cases only, 



N = 711 _ 

Data Set 0 Mean 5 .D r t p Miri Max 

Sim-3PL -0.25 1.03 0.9 1 -0.85 0;3930 -3.94 3.80 

True 0s -0.27 0.84 -2.86 2.70 



Table 8 : 0 Analysis from CAT run on simulated data, 3PL & Nominal jointly convergent cases drily, 
N-711 

Data Set 0 Mean S.D_ _ __r i p_ Min_ Max 

Sim-Nom -0.08 1.27 0.93 -8.48 0.0001 -3.98 2.63 

True©s -0.27 0.84 -2.86 2.70 



Table J> : 0 Analysis from. CAT run on simulated data, .3PI. convergent cases only; true 0s GE 0.0, N = 521 
Data Set 0 Mean S.D r t p Miri Max 

Simulated 3PL 0.94 0.81 0.89 -9.60 O.OOOl -0.92 3.80 

true 0s 0.78 0.62 0.0 3.23 



Table 16 : 0 Analysis from CAT ran on simulated data, Nominal convergent cases only; trae 0s GE 0.0, 

N = 267 _ _ 

Data Set 0 Mean S.D r t p Min Max 

Simtalated Noininal 1.11 0.48 0.71 -22.25 0.000 1 0.24 2.63 

True 0s 9,55 9.57 0.0 2.70 
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Table 11:0 Analysis from CAT run on simulated data, 3PL ft Nominal jointly convergent cases only; true 
_ _ ®s GE fl.e, N = 266 _ _ _ 

Data Set 0 Mean S.D r t p Miri Max 

Simulated 3jPL 0.65 0.75 0.86 4.11 0.0001 -0.92 3.80 

True 0s 0.55 0.56 0.0 2.70 

Tsible 12 : 0 Analysis fronn CAT run on simulated data, 3PL & Nominal jointly convergent cases only; for 
true 0s GE D.O, N = 266 

Data Set 0 Mean S.D f t p Min Max 

Simulated Nominal MO 0.48 0.7 1 -22.52 0.000 1 0.24 2.63 

True 0s 0.55 0.56 0.0 2.70 

Table 13:0 An^sis from CAT run on simuiatcd data, comparison of both models jointly convergent 
_ _ cases Dnly; true 0s GE 0^0^N = 266 

Data Set 0 Mean S.D r t p Miri Max 

Simulated 3PL 0.65 0.75 0.62 -12.75 0.0001 -0.92 3.80 

Simulated Nomind 1.10 0.48 0. 24 2.63 

Table 14 : 0 Analysis from CAT run on simuiatcd data, 3PL convergent cases only; tnie 0s LT 0.0, 
N = 455 

Data Set 0 Mean S.D r t p Min Max 

Simulated 3PL -0.82 0.80 0.84 2.17 0.03 -3.94 0.95 

True ©s -0.78 0:59 -2.86 -0 002 

Table 15 : 0 Analysis from CAT nin bri simuiatcd data, Nominal convergent cases only; tnie 0s LT 0.0, 
N = 457 

Data Set 0 Mean^ S.D r t p Min Max_ 

Simulated Nominal -0.85 1.07 0.95 2.75 0.006 -3.98 0.622 

true 0s -0.78 0.59 -2.86 -0.002 

Table 16 : 0 Analysis frorii CAT run bh sirhulated data, 3PL & Nominal jointly convergent 

cases only] tnie 0s LT 0.0, N = 445 
bata^et -- 0 Meart S.D r — t p Mm Max 

Simulated 3PL -0.79 0.77 0.83 1.77 0.08 -3.94 0.95 

true 0s -0.75 0.56 -2.86 -0.002 

Tsible 17 : 0 Analysis from CAT run bh simulated data, 3PL & Nominal jointly cbhvergerit cases only; true 
0SLTO.O, N = 445 

bata5et ©Mean S.D r t p Min Max 

Simulated Nominal -0.80 _ K03 0.95 1.90 0.6 -3.98 0.62 

True 0s -0.75 0.56 -2.86 -0.002 

Table 18 : 0 Analysis from CAT run on simuiatcd data, cbrnparison of both models jointly convergent 

cases bnlyi tnie 0s LT 0.0, N == 445 
Data Set ©Mean S.D r t p Min Max 

Simulated 3PL -D.79 0.77 0.78 0.34 0.74 -3.94 D.95 

Simulated Nbmirial -0.80 1.03 -3.98 0.62 

Table 19 : Descnptive Statistics : Standard Brrors from 0 estimates for GAT runs with simulated data sets. 
Model for CAT N Mean Mode S.D. Min Max 

3PL^ 976 0.38 0.40 0 09 0.18 1.09 (con vergent cases) 

Nominal 224 0.24 0.19 0.11 0.19 1.05 (cojivergcnt cases) 

3PL 711 0.40 0.40 0.08 0.18 1.09 (joint cbrivergehcc) 

Nominal Hi 0.23 0.19 0.10 0.19 1:05 (joint convergence) 
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^ Figdre 1 

3PL & NOMINAL MODEL TOTAL INFO FOR SIM DATA 




• Figure 2 ' 

S. E. FOR 3PL;J0INTLY CONVERGENT CASES 




Figure 3 



S.E. FOR NOMINAL;JOINTLY CONVERGENT CASES 
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^ Figure 4 

TRUE THETA - 3PL EST.; JOINTtY CONVERGENT 
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Figure 5 

TRtiS THETA - NOMINiyL EST.; JOINTLY CONVERGENT 




. * Figure 6 

FREQ OF TRUE THETA FOR NOMINAL NONGON^RGENT CASES 
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